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Abstract 

In this paper we suggest that, under suitable conditions, supervised 
learning can provide the basis to formulate at the microscopic level quan- 
titative questions on the phenotype structure of multicellular organisms. 
The problem of explaining the robustness of the phenotype structure is 
rephrased as a real geometrical problem on a fixed domain. We further 
suggest a generalization of path integrals that reduces the problem of de- 
ciding whether a given molecular network can generate specific phenotypes 
to a numerical property of a robustness function with complex output, for 
which we give heuristic justification. Finally, we use our formalism to 
interpret a pointedly quantitative developmental biology problem on the 
allowed number of pairs of legs in centipedes. 
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1 Introduction 

Leon Ehrenpreis was a singular mathematician. Not only he had a gift and 
a vision for a deep understanding of mathematics, but he had a passion for 
the construction of overarching approaches, that would allow a general com- 
prehension of vast areas of mathematics. This passion is embodied in his two 
masterpieces, Fourier Analysis in Several Complex Variables, [7], and The Uni- 
versality of the Radon Transform, jS], but is also apparent in the many papers 
he published, for example, on wide generalizations of the Edge-of the Wedge 
theorem. 
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Reading Ehrcnpreis' works, we are reminded of a beautiful phrase that 
Kawai, Kashiwara, and Kimura insert in [l6j . just after the proof of the Water- 
melon Theorem: "It was like looking down at the valley after reaching the peak 
of the hill". 

This aspiration to a global view of mathematics, that would offer insights, 
even in advance of a fully realized technical description of that view, is part of 
what made Ehrenpreis work uniquely captivating, and uniquely fertile. Many 
mathematicians, of at least three generations, have worked to understand, for- 
malize, explain, refute, demonstrate, statements in Ehrenpreis work. It is for 
this reason, that Ehrenpreis student, C.A.Berenstein, once noted in his review 
[U of [5] that "a book that is worth studying, although mining may be a more 
appropriate word, as the reader may find the clues to the keys hes searching for 
to open up subjects that are seemingly unrelated to this book. Thus, one finds 
at the end that the title is justified." . 

It is in this spirit, but with a deep sense of humility and with a full awareness 
of our limitations, that we would like to present, in this paper, a proposal, 
a strategy for a way to mathematically understand and describe one of the 
fundamental problems (the fundamental problem?) of modern biology: how can 
we understand macroscopic biological traits from our knowledge of molecular 
level information. 

The proposal we make is inspired by the instrument of path integrals, which 
is probably the most enduring legacy of Richard Feynman [5], and for which 
we suggest here a tentative generalization to provide a plausible tool for the 
description of the macroscopic properties of a biological system. 

Our specific point of departure lies in an important, and somewhat sur- 
prising, fact. In [19] chapter 9 it is reported that, out of all known species of 
centipedes, there are about 1000 species with 15 pairs of legs, none with 17 or 
19 pairs, several with 21 and 23 pairs, and a few distributed over a very large 
range from 27 to 191. No centipedes have an even number of pairs of legs, 
and some species have a stable interspecies number of pairs of legs, while some 
others display a variability of the number of legs among individuals. 

Can we explain in any quantitative way this striking pattern of gaps with 
respect to the dynamics at fine molecular scale? And how do we express the 
remarkable robustness of the resulting phenotypes? To be more specific, we 
state the following 

Problem 1 (Centipedes Segmentation (CS) Problem) Show that it is im- 
possible to have centipedes with an even number of pairs of legs, or with 17 or 
19 pairs of legs. 

Clearly if we can suitably quantify this problem, we will be able to generalize 
the question to the full gap structure that we have described in the previous 
paragraph. Moreover, we take the CS problem simply as emblematic of the va- 
riety of developmental biology problems in which we observe strong constraints 
on the phenotype ([19j. page 86), without having a conceptual frame through 
which to approach these problems. 
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In general, molecular biology approaches to developmental biology problems 
such as the CS problem require the construction of an appropriate map that can 
relate microscopic variables with macroscopic outputs. But, since a mechanistic 
way to relate these variables is often absent, and it is indeed problematic even 
to quantify phenotype properties [12], it is challenging to even express these 
questions in a proper mathematical setting. Even though we are aware of the 
dangers of suggesting a general frame that is not fully developed, we still believe 
(in view of the difficulty of the problems at hand) that it is worthwhile to attempt 
to set up a language and a set of techniques through which these problems might 
be approached. 

Ideally, we would need to study entire sets of models at once, since we would 
expect the micro/macro maps to be stable under wide variations of the pa- 
rameters in the model of the microscopic dynamics. At the same time, such 
tremendous model variability should be understandable in a compact, possi- 
bly analytical setting to have any hope of providing computationally feasible 
answers. 

In this paper we suggest that supervised learning and the resulting classifi- 
cation functions [11] can provide the basis to formulate at the microscopic level 
questions on the phenotype, such as the centipedes segmentation problem, pro- 
vided that the classification function satisfies some suitable growth conditions. 
The problem of phenotype robustness can then be rephrased as a problem of real 
geometry on a fixed domain. General methods to solve such problems are still 
in their infancy [5] and we propose a generalization of path integrals that allows 
to reduce the problem of class belonging and of phenotype robustness to spe- 
cific questions on functions with complex output. Finally, we will show how to 
reduce the CS problem to a problem on the global properties of these functions. 
Other problems about the restricted variability of phenotypes in developmental 
biology could be formulated in similar ways. 

2 A Geometrical Robustness Condition 

Let X — (Xi , . . . , Xn) be the activation level of a set of proteins (genes, metabo- 
lites, or combinations of) at some time to and assume that we have access to the 
derivatives X — (Xi, . . . , Xn) of those levels at the same time to (in practice 
this means that we measure the proteins at two very close time points); and 
suppose the biological samples from which (X, X) is measured can be classified 
in a set of M classes C\, . . . , Cm- 

We assume the protein measurements are taken at the embryo stage of de- 
velopment of the individuals in the CS problem. Clearly these measurements 
will be some sort of average of the activity levels of several cells [6] , even though 
single cell measurements can be envisioned |13j . We further assume that the 
underlying network of interactions is stable (i.e. no essential parameter varia- 
tion that changes significantly the dynamics) over a short time range. Without 
excluding in principle other possible state variables, we focus our attention on 
protein networks as these are believed to be evolutionary more stable [22] . 
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Call B(X,X) an instance of the biological class associated to (X, X) (in 
the case of the CS problem, this will be the number of pairs of legs of an 
adult individual centipede). We assume that we have access to a training set 
of instances for each class, so that we can build a special type of classifier that 
has the following structure: 

Definition 1 (Interval Classifier) A function F(X,X) is an interval classi- 
fier for the classification problem with classes S m , m — 1, . . . , M , if it satisfies: 
m-l< F(X, X) < m when B(X, X) € C m m = 1, . . . , M. 

Though not strictly necessary at this point, it is useful for our subsequent 
analysis of Propositions 1 and 2 to require that F is bounded at infinity and 
analytic, and therefore we assume for simplicity that the classifier function F is 
a neural network with exponential sigmoidal activation functions (|14j. chapter 
10). 

Remark 1 At a fundamental level, it is not necessarily possible to identify a 
subset of molecular variables that is indeed predictive for the phenotype charac- 
teristic that we are interested in. This problem is not exclusive to our setting, 
but it is a major difficulty in all approaches that try to bridge molecular biology 
with the study of phenotype characteristics. The very existence of an accurate 
classifier F depends on the identification of such variables. Note that our setting 
requires to estimate not only state variables, but their derivatives as well. In 
classical rational mechanics state variables and their first derivative are suffi- 
cient to characterize a system for all future times (in a variational, Lagrangian 
setting [1]). While first derivatives are also sufficient for our formal analysis 
of phenotype classification problems, it would be interesting to understand how 
many derivatives are truly necessary to have effective classifiers for these types 
of problems. 

Note that the way we define the multi-classes classifier F is not the standard 
one, in which an M-classes problem is usually approached by having a vector 
of M output classification functions ([14], page 331). For reasons that will be 
clear when we reinterpret analytically the CS problem in Problem 2 of the last 
section, we not only use a single function F for the multi-class problem, but we 
also require that all instances belonging to a certain class must be within a given 
interval. This superimposes a stronger metrical structure on the classification 
problem. 

Remark 2 In the context of neural networks, the request of interval classifier 
for training instances within each class transforms the unconstrained optimiza- 
tion problem usually associated to finding the classifier function ('141, page 335) 
into a constrained optimization problem. 

Take now a slack variable Y and consider the function F(X, X) — Y, with 
(X, X) 6 D x D, where D is the set of biologically meaningful conditions for X 
and D is the set of biologically meaningful conditions for X. Then if B(X, X) £ 
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Ci there exists Y with < Y < 1 such that F(X,X) - Y = 0, so that the 
condition B(X, X) G C x can be rewritten as F(X, X) - Y = 0, (X, X) G D x L>, 
< y < 1. Similarly, B(X,X) € C m becomes F(X,X) - Y = 0, G 
D x D, m - 1 < Y < m. 

It is reasonable to suppose that X is in fact a state variable of an or- 
dinary differential equation (ODE) network x = f(x,ao), x = (xi, . . . , xn), 
f = (/i, . . . , /jv), / a vector of polynomials in x, modeling ODEs with poly- 
nomials or power functions has proven itself to be very flexible for systems of 
molecular reactions [3T]. We further ask that f(x,a) is an analytic function in 
a. As the condition of analytic structure of the classifier F itself, the analyticity 
of f(x, a) in a will be important in the justification of Proposition 2. 

A network of biological significance will usually depend on a large number 
of parameters that will depend on the environment where the variable of the 
network actually act and live [10] . this is the reason we allow a dependance 
from the parameter vector a in the ODE network. Write the dependence of x 
from f{x,a) explicitly in F(X,X) - Y = 0, i.e., F(X,f(X,a)) - Y = 0. The 
condition of class belonging can be written as: 

Definition 2 (Network Classification) A network x — f(x, do) generates 
phenotypes belonging to class C' m if 

3X eD, m-KY <m : F(X, f(X, a )) - Y = 0. (1) 

Equation ([lj is a condition for the network x = /(a;, oo) to give rise to states 
that belong to one of the classes we are considering. Note that the domain D 
of X constrains the domain D through the relation D — f(D). 

The macroscopic phenotypic states of an organism are believed to be robust 
under wide ranging changes of parameters [10] . Therefore, for a realistic net- 
work, ([T]) should be satisfied for all parameters in a region A, where A is some 
sizable neighborhood around a nominal value a = oq of the parameter a. In 
other words: 

Definition 3 (Class Robustness) A phenotype class C m is robust if the zeros 
of the function F(X, f(X, a)) — Y = are persistent in a region A of parameters, 
i. e. : 

Ma G A, 3X G D, m - 1 < Y < m : F{X, f(X, a))-Y = 0. (2) 

Remark 3 We assume that, for each a, x = f(x, a) is capable of generating 
(x, x) belonging to a single class C m , to avoid, in the CS problem, the paradoxical 
situation in which the predicted number of pairs of legs can change in a given 
centipede with time. We assume instead that the embryo is committed to its 
specific segmentation within a large time frame where we could measure our 
state variables. 

3 Stable Zeros and Path Integrals 

In the previous section we described a condition ([2]) that must be satisfied if 
x = f(x, a) is to generate robustly states that belong to a class C m . The problem 
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with this condition is that it requires identification of zeros of a (non-algebraic) 
function over a real domain, and moreover it requires us to establish that these 
zeros are stable under a wide variation of parameters. This is problematic as it 
is difficult to establish the existence of solutions of real equations on domains, 
even in the algebraic case ([3]). 

In this section we show that a generalized path integral [5] can be built in 
such a way that a specific condition on this integral corresponds to the verifi- 
cation or falsification of over a domain. Path integrals have the remarkable 
property of giving information, in a single analytical object, about global, col- 
lective properties of physical systems, a point of view especially stressed in 
condensed matter field theory literature [2], and it is this ability that we will 
try to mirror in the setting of network analysis. We start by building a path in- 
tegral that is related to ([]} . Essentially, we will build a domain G and a function 
L such that if ([T]) is satisfied, then there is at least a path connecting 2 points 
in G. This path (and a small tubular neighborhood thereof, with squeezed end- 
points) will dominate the path integral that we are building and it will allow us 
to make qualitative conclusions on the value of the integral when ([!} is verified. 
We mirror then this analysis for ([2]) . We first go through the technical building 
of the path integral, before we explain its heuristic interpretation. 

The condition x = (xi, . . . , xn) € D in |T]) can be explicitly written as 
a condition on each variable, i.e. d n b < x n < d nt , where d n b and d n t are 
lower and upper bounds on the biologically meaningful values that variable x n 
can assume; in principle these values can be measured over repeated in vitro 
experiments. We can always change variables x n — > x n so that —1 < i n < 1. 
This is accomplished by setting: 

2 

~, ~~i i^n drib) 1 

Similarly, we can force — 1 < y < 1 by setting y — 2(y — (to — 1)) — 1. These 
are invertible linear transforms, so we can write 

y - 1 

3x,y : F(x, f(x,a))- (— h(m-l)) = 0, -1 < x n < 1, -1 < y < 1 a = a . 

(3) 

for some transformed functions F, f — (f±, /at) which are obtained from 
F and / by replacing x with x and y with y. We further simplify notation by 
defining the analytical function 

H(x, y, a, to) = F(x, f(x, a)) - (^ + (to - 1)) (4) 
So we can rewrite condition as: 

3x, y : H(x, y, a, m) 2 = 0, on — 1 < x n < 1, — 1 < y < 1, a = clq. (5) 

We square the function H for purposes that will be clear in the following 
(see equation and its justification) . The domain restriction on x n and y can 
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also be written as (x, y) € S\ X • • • X <Si where 1S1 is the unit interval [—1,1] 
and we take the cartesian product N + 1 times. We now introduce a spherical 
extension of this domain in such a way that on every section of the extended 
sphere we can formulate a condition similar to ([5])- 

To build the spherical extension, first suppose we work with a single variable 
x n , keeping all other variables constant. We embed each point x n in the disk S\ 
in the space R 2 with the map x n — > (x n , 0) . We want then a basic way to map 
points (x n ,0) on (Si, 0) to points (x nz , z) in the slices (*, z) for z in — 1 < z < 1, 
and moreover we want the whole set (Si, 0) to be mapped to the points (—1,0), 
(1, 0) in the limit of z — > ±1. 

One way to achieve this embedding is through maps x nz — x n \/\ — z 2 . 
Conversely, any point (x nz ,z), in V n = {—1 < z < 1, —y/l — z 2 < x nz < 
yjl — z 2 } can be mapped to a point in Si by setting i n = x nz ^j=g ■ If we do 
a similar mapping for all x n , and for y as well, the function H(x,y,a,m) 2 can 
be extended to the following function of variables (x z , y z , z, a, m): 

L(x z ,y z ,z,a,rn) = H(x z —=L=,y z —=L= 7 a,m) 2 (6) 

Vl —2 V 1 — z z 

where x z = (x\ z , Sjvz), and Y z 6 Vy with = {—1 < z < 1, — vl — < 
y z < vl - z 2 }. The same value of z is used to define all components of i,z and 
y z , so we can define the domain of all points in the spherical extension as: 



V = {(x z ,y z ,z), -Kz<l, -v 7 ! - z 2 ) <x nz <^/l- z 2 , - y/l - z 2 < y z < y/l - z 2 }. 

We are now ready to introduce the generalization of the path integral that 
we aimed for: 

Definition 4 (Geometric Path Integral) Let ^(t), < t < 1 be a path in 
T> with t(0) = (0,0,-1) and 7(1) = (0,0,1). LetT be the set of all such paths, 
and let D7 be a suitable measure on V . We define the geometric path integral 
associated to condition jSJ, and dependent on a parameter h > 0, as: 

P(a,m,h)= [ e *J'o i (7(t),a ) fi0«M£) 7 (7) 
J-ter 

In general the integrand L would not be defined at the extremes of the paths 
7. But since we chose it to be the square of a (linear transformation of a) 
neural network classifier F with exponential sigmoidal activation functions, then 
for each path 7 the integral is well defined as z —¥ ±1, since the exponential 
flattening of the sigmoidal functions (|14j. page 225) assure that the limits of 
L for z — > ±1, with extension maps x nz — x n \J\ — z 2 , and all other variables 
fixed, do actually exists. 

Remark 4 The choice of the appropriate measure Dj that ensures convergence 
of path integrals is very delicate and it will require further investigation in the 
context of geometric path integrals for appropriate classes of integrands L. Note, 
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however, that the geometrical path integral is defined on compact sets, and this 
is a scenario where the standard path integrals are amenable to rigorous conver- 
gence results fThy . 

In order to understand the motivation for the integral in (JTJ) , we consider the 
case in which L (and therefore H) has a set of zeros in T)f]{(x z , y z , z) : z = 0}, 
specifically we can assume that L(xo, yo, 0, a, m) — 0. We can then build a full 
path j in V such that L(<y(t),a, m) = for every t in [0,1], just by taking 
suitable mappings of (xq, yo) m T> for all values of — 1 < z < 1. 

Now, following standard heuristic arguments for semi-cassical approxima- 
tions of path integrals [9], [2] chapter 3, [20] we expect the following result to 
hold. We denote by and 9/ the real and imaginary part of / respectively. 

Proposition 1 (Geometric Path Integral Real Dominance Conditions) 

// the network x — f(x,a) can generate states belonging to class C m , then 
tfl{P(a,m,h)) > 0, 5R(P(a, m, h)) » 3(P(a, m, h)) for all positive values h 
sufficiently close to zero. 

Heuristic Justification: Since L — H 2 in ([6]) is a quadratic functions, and H 
is linear in y, all the first derivatives of L vanish only when L itself is zero. 
Moreover, if x = f{x,a) can generate states belonging to class C m , from Defi- 
nition 2 we know that F(X, f(X, a)) — Y = has a solution in the domain that 
establishes class belonging, and therefore H = has an appropriate solution as 
well (see (|4|) and (0)). This implies that there is a path 70 € V such that L is 
identically zero on 70, and that the functional S(j,a,m) — J Q L(^(t),a,m)dt 
has first order functional derivatives equal to zero as well, at 7 = 70. The 
path 70 is therefore an extremal path for #(7, a, m). In the limit of h — > 0, 
the extremal paths, and quadratic fluctuations around them, will dominate the 
geometric path integral, since all other non-extremal contributions to P(a, m, h) 
will mostly cancel each other out because of the much faster phase interference of 
the corresponding exponential integrals in P(a,m,h). For near-extremal paths 
in a neighborhood of extremal paths, we have ^0 L(-y{t),a,m)dt ^ e £ f Q odt w 
and they will provide a large, real positive contribution to P(a,m, h), so that 
P(a,m, h) rs p + iq with p positive and p » q, if h is sufficiently small, pos- 
sibly up to a multiplicative phase factor that depends only on quadratic local 
fluctuations of near-extremal paths around the paths for which 5(7, a, m) = 

Remark 5 In standard path integrals, there may be a different change of phase 
for each of the individual contributions of extremal paths to the overall integral 
C[20\j chapter 17). Essentially, this is due to the fact that extremal paths may 
not be globally minima of the functionals that replace S{^, a, m) in standard path 
integrals. No such problem arises for the extremal paths used in our heuristic 
justification, since they all achieve the very minimum (zero) value allowed for 
the functional S("f, a, m) itself. 
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Remark 6 For geometric path integrals, the extremal paths are not isolated, 
when they exist. This may require techniques from functional field integrals (i.e. 
higher dimensional path integrals, see '2J, chapter 4) for the actual derivation 
of semi-classical types of approximations in the limit h — > 0. 

The computation of P(a, to, h) is a global approach to identify zeros of func- 
tions in a specific real domain. Indeed there is a dependence on the original 
domain of biologically meaningful conditions that is hidden in the definition 
of the function L. However, we really want to know whether these zeros are 
persistent in a full measure subset A of a domain A of parameters. Because of 
this additional requirement, we need one more step before we can fully express 
the condition ([5} with the geometric path integral formalism. This is achieved 
by taking an ordinary integral of a function of P(a, m, h) with respect to the 
parameter vector a, in the domain A where we want to enforce robustness as in 
©• 

Definition 5 (Robustness Function) The robustness function R(m, h) as- 
sociated to class C m is, for h > 0: 

R(m,h)= [ P(a,m,h)e-i^ p(a - m < hy)2 da (8) 

J A 

This definition of robustness may formally remind the reader of the one 
proposed by Kitano in 17] ■ The two proposals, however, are substantially 
different, since Kitano considers a space of perturbations and defines a measure 
of robustness through integration on that space. 

What is crucial for our interpretation of the CS problem is the fact that 
R(m, h) inherits the real dominance conditions from P(a, to, h), namely: 

Proposition 2 ( Robustness Function Real Dominance Conditions) If 

a phenotype class C m is robust for a region of parameters A C A, then the robust- 
ness function R(m, h) satisfies the real dominance conditions, i.e., 5R(i?(m, h)) » 
3(iJ(m, h)), and ^R(R(m,h)) > for all positive values h sufficiently close to 
zero. 

Heuristic Justification: We make the assumption that the imaginary part of 
P(o, to, h) goes to zero fast enough as h — > if P(a, to, h) satisfies the real 
dominant conditions, more particularly we assume \3P(a,m,h)\ Ri with 
e > for h small. Now, from Definition 3, if a phenotype class C m is robust, 
then there are persistent zeros of the function F(X, f(X, a)) — Y = in the ap- 
propriate domain, and P(a,m,h) satisfies 3?(P(a, to, h)) » 3(P(a, to, h)) and 
5R(P(a, to, h)) > for all a in some region A C A. Therefore for a in such region 
A, P(a,m,h) will give large, real positive contributions to R(m,h) for h that 
goes to zero, since the exponential in will converge to 1. Suppose instead we 
are in a region A C A where the functional S'(7, a, m) = J Q L(^(t), a, m)dt has 
no extremal paths for all a E A. Note that, reverting to a coordinate representa- 
tion for L, for every small Aa, L(x z , y z , z, a, m) and L(x z , y z , z,a + Aa, m) will 
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be equal on at most a finite number of points in T>, since we asked that f(x, a) 
was analytical in a, and F is also assumed analytical in its arguments. The dif- 
ferences, small, but located almost everywhere in V, between L(x z ,y z , z,a,m) 
and L(x z , y z , z, a+Aa, m) will be enhanced in the limit of h leading to large 
differences in the phases of P(a, m, h) and P(a + Aa, m, h). Therefore nearby 
geometric path integrals inACi will have uncorrelated phases for h that goes 
to 0. In particular, for each h, the set of points in A for which QP(a, m, h) = 
is of measure zero, and therefore this set can be removed when computing the 
integral in flSJ). For all remaining a £ A, the exponential in © will suppress 
to the contribution of the corresponding P(a, m, h) to R(m, h) in the limit 
h — > 0. We can conclude that the contributions to R(m, h) from path integrals 
in A will be subject to strong phase interference, and also that their individual 
contributions to R(m, h) will have norm that converges to 0. Putting together 
this result with the real dominant contributions from regions A of A for which 
5(7, a, m) has extremal paths, we conclude that R(m, h) will satisfy the real 
dominant conditions for all h sufficiently close to 0. 

Remark 7 While the real dominant conditions of propositions 1 and 2 are only 
necessary conditions to the existence of zeros and persistent zeros for H re- 
spectively, these conditions are likely to be sufficient for a generic H . In the 
absence of extremal paths for #(7, a, m), it is unlikely that real dominant con- 
ditions would hold for all h sufficiently close to zero, as in the limit the phase 
of P(a,m,h) becomes increasingly uncorrelated as a function of both h and a. 

Also, if e -7r( ap K m >' 1 )) 2 in © is substituted by e^ ( ^fe^T )2 , the condition 
|3(P(a, m, h))\ R3 h 1 ^ 2+e in the justification of Proposition 2 could be substi- 
tuted by the weaker \^(P(a,m,h))/$t(P(a,m,h))\ rs h}^ 2+<i , at the price of a 
slightly more complicated argument. 

Remark 8 The real dominance conditions for geometric path integrals seem to 
portend a method to establish the existence of solutions of equations (in particu- 
lar real equations) in bounded domains, that does not depend on constraints on 
the signs of first derivatives. More specifically, suppose that we want to know 
whether g{x) = has zeros in a domain D, then we can substitute the function 
H in ^ with H(x,y) = (g(x) — y) 2 (we have no dependence from a and m in 
this setting, and no change of variables). The boundaries for x can be inferred 
directly from D and we take y in the domain D y (e) — {y : — e < y < e} for 
e > 0. We can use this function H(x, y) in the definition of the geometric path 
integral, so that the partial derivatives of the corresponding function L in ([7jl 
are all zeros only when g(x) — y — 0. Therefore, if g(x) has zeros in the domain 
D, then g(x) — y — at least for some (x, y) G D x D y {e), and a real dominance 
condition on the geometric path integral will hold on D x D y (e) for all e suffi- 
ciently small. We will explore this important application of our technique in a 
subsequent paper. 
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4 Centipedes Segmentation Problem Reinterpreted 



We come back now to the problem that motivated this work. How to interpret 
a pattern of allowed changes in phenotype on the basis of the structure of the 
underlying molecular network? In the context of the centipedes' segmentation 
problem, we assume that the network x — f(x,a) is essentially the same for 
all species of centipedes, with only the set of parameters a changing from one 
species to the other. This assumption is not unreasonable, if we think that 
the same species of centipedes can display different individuals with different 
number of segments, showing that there is, in the same network the potential for 
variable segmentations. Moreover, we would expect the process of segmentation 
to be evolutionary stable ([19]. page 53). 

Classify now the networks in such a way that the classification function 
mirrors the quantitative phenotype structure. In the setting of the CS problem, 
if x = f(x, a) gives rise to a phenotype with 15 pairs of legs, we assume that 15 — 
1 < F(X,X) < 15 for all states (X, X) arising from that network. Effectively, 
we treat F as a nonlinear regression model, which predicts the number of pairs 
of legs from state variables. Except for the fact that we do not simply want to 
know what is the output of F under a specific input (X, X), but we must assure 
that some suitable (X, X) can be generated stably from the network x — f(x, a). 
We use the robustness function R to formulate a quantitative version of the CS 
problem as follows: 

Problem 2 (CS Problem Reinterpreted) Let x = f(x,a) be an analytic 
network, polynomial in x, with a G A and x G D that describes the molecular 
dynamics of relevant signaling compounds in centipedes' embryos. Show that 
there is no interval classifier F , with growth conditions compatible with the 
geometric path integral definition in 1(7]). and trained on a set of actual data 
for known centipedes segmentation classes C mi , . . . , C mk , such that, for all h 
sufficiently close to zero, R(m, h) satisfies the real dominant conditions for m 
even, m = 17, 19. 

We assume that the integral defining R is taken over a very large domain 
A, so that we can suppose that different segmentation phenotypes correspond 
to different regions of parameters within A. 

According to the network path integral formalism, Problem 2 is equivalent 
to stating that it is not possible to find sizable volumes of parameters in A such 
that x = f(x, a) always gives rise to state variables (X, X) such that F{X, X) is 
even or equal to 17, 19; when F(X, X) is trained to properly predict the allowed, 
known number of pairs of legs of centipedes. The quantitative interpretation of 
the CS problem is not dependent on a specific classifier, and it rather enforces 
some properties on any classifier that we may derive from experimental data. 

We defined interval classifiers in Section 2 exactly to be able to achieve this 
compact interpretation of the CS problem, the function R(m, h) is linked to the 
corresponding class C m just by the single parameter m, that, in principle, could 
be treated now as a continuous variable. 
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Remark 9 The geometric path integral formulation of the CS problem allows 
us to comment on some essential differences between mathematization on biol- 
ogy and in physics. The theoretical tools used to solve problems can be similar 
in the two fields, as we suggested with the development of the geometric path 
integral formalism. But in the biological setting we lack the ability to unify our 
understanding of multiple problems: the functionals in the geometric path inte- 
gral interpretation of the CS problem are derived from the classifiers found with 
supervised learning, and therefore they are not amenable to interpretation. It 
is as if every question that we may ask about phenotypes requires its own the- 
ory, and associated geometric path integral, not reducible, even in principle, to 
simpler geometric path integrals. 

5 Challenges Ahead 

In this section we highlight some of the major problems that need to be ad- 
dressed regarding geometric path integrals and their applications!]]. 

First, it is to be seen how known analytical techniques to evaluate and 
approximate path integrals |18j apply to the highly non-standard geometrical 
path integrals derived from biological classification problems. At the very least, 
we would expect numerical approximation of these integrals to be possible, 
and hopefully less computationally intensive than an actual resolution of the 
associated geometrical problem in ([5]), especially for very large domains. 

Moreover, path and functional field integrals are powerful qualitative tools 
to describe the global state of large systems [2J, and similar methods for geo- 
metric path integrals may allow to rule out real dominant conditions for entire 
families of functions H. In particular, to approach the geometric path integral 
interpretation of the CS problem, we would need methods that can constrain ef- 
fectively the sign of the real part of R(m, h) for large spaces of classifiers trained 
on a set of experimental data. It would also be important to develop the theory 
of geometric path integrals to allow for a precise estimate of the size of the 
parameters for which |5J is satisfied. This parameter size can vary for different 
classes, and therefore a careful estimate could be used to compare the relative 
robustness of different classes. 

We point out that a network may be constrained by several classifier func- 
tions, if different phenotype characteristics are dependent on it. The function 
L in the geometric path integral ([7]) can be extended to these cases by taking 
a sum of squares of the classifier functions where each of them requires the in- 
troduction of a new slack variable, and all heuristic arguments that lead to the 
real dominance conditions can be repeated in this generalized case as well. 

Finally, we note that if the classifier function F is fixed, it is possible to 
ask questions on the topology of the networks x = f(x,a) that are compatible 
with the real dominance conditions for each specific class. In other words, the 
analytical structure of suitable geometric path integrals may encode and shed 

1 Refer to Remarks 5 an 6 for outstanding issues related to the definition of geometric path 
integrals. 
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light on the structure of the topologies of molecular networks that are compatible 
with some given phenotypic outcomes. 
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